[WIP] AI/LLM integration #1325

perfectra1n · 2025-03-03T02:59:46Z

Status:

Goals:

Create vector embeddings and store in the SQLite DB
Create an "index" to provide as initial context to LLMs
Allow a user to "chat" with their notes (emphasis on the CONTENT of the notes, relationships not so much)
Allow the use of Anthropic (Claude), OpenAI (ChatGPT), or Ollama

Out of scope:

The extremely complex relationships that Trilium supports, and being able to ask all the providers about it
Agentic tools (will be required for the above)

…g to save tokens

eliandoran · 2025-03-26T17:43:46Z

src/public/app/components/note_context.ts

@@ -369,6 +369,11 @@ class NoteContext extends Component implements EventListener<"entitiesReloaded">

        const { note, viewScope } = this;

+        // For llmChat viewMode, show a custom title
+        if (viewScope?.viewMode === "llmChat") {


Do we really need a specific view mode for chats? The idea is that view modes are supposed to be pretty generic like view sources, in app help, attachments.

A good view mode candidate would be to view the text representation of notes, to see what the LLM "sees".

eliandoran · 2025-03-26T17:44:33Z

src/public/app/components/root_command_executor.ts

+
+        try {
+            // We'll use the Note Map approach - open a known note ID that corresponds to the LLM chat panel
+            await appContext.tabManager.openTabWithNoteWithHoisting("_globalNoteMap", {


If we want to go with the note map approach, then we need a note specific for the LLM chat in the hidden subtree with a normal view.

eliandoran · 2025-03-26T17:45:05Z

src/public/app/desktop.ts

@@ -27,6 +28,16 @@ bundleService.getWidgetBundlesByParent().then(async (widgetBundles) => {
        });
        console.error("Critical error occured", e);
    });
+
+    // Initialize right pane tab manager after layout is loaded
+    setTimeout(() => {


This feels kind of like a hack or a work-around, doesn't it? What problem does it solve?

eliandoran · 2025-03-26T17:45:31Z

src/public/app/layouts/desktop_layout.ts

+        // Initialize the right pane tab manager after widget render
+        setTimeout(() => {
+            const $tabContainer = $("#right-pane-tab-container");
+            const $contentContainer = $("#right-pane-content-container");
+
+            if ($tabContainer.length && $contentContainer.length) {
+                rightPaneTabManager.init($tabContainer, $contentContainer);
+            }
+        }, 500);


Same here, why are we running the code behind a delay?

eliandoran · 2025-03-26T17:46:08Z

src/public/app/services/app_service.js

@@ -0,0 +1,13 @@
+function initComponents() {
+    // ... existing code ...


Where's the existing code?

eliandoran · 2025-03-26T19:53:56Z

src/services/llm/agent_tools/query_decomposition_tool.ts

+        if (query.toLowerCase().includes("provide details about") ||
+            query.toLowerCase().includes("information related to")) {


Maybe we should extract all these strings to translatable strings, or at least gather them as constants in a single place.

We need to be able to support multiple languages in the future, especially now that we added internationalization support to Trilium.

eliandoran · 2025-03-26T19:57:48Z

src/services/llm/context/code_handlers.ts

+    // Simple heuristics for common languages
+    if (firstLines.includes('<?php')) return 'php';
+    if (firstLines.includes('#!/usr/bin/python') || firstLines.includes('import ') && firstLines.includes('def ')) return 'python';
+    if (firstLines.includes('#!/bin/bash') || firstLines.includes('#!/usr/bin/bash')) return 'bash';
+    if (firstLines.includes('#!/usr/bin/perl')) return 'perl';
+    if (firstLines.includes('#!/usr/bin/ruby')) return 'ruby';
+    if (firstLines.includes('package ') && firstLines.includes('import ') && firstLines.includes('public class ')) return 'java';
+    if (firstLines.includes('using System;') && firstLines.includes('namespace ')) return 'csharp';
+    if (firstLines.includes('package main') && firstLines.includes('import (') && firstLines.includes('func ')) return 'go';
+    if (firstLines.includes('#include <') && (firstLines.includes('int main(') || firstLines.includes('void main('))) {
+        if (firstLines.includes('std::')) return 'cpp';
+        return 'c';
+    }
+    if (firstLines.includes('fn main()') && firstLines.includes('let ') && firstLines.includes('impl ')) return 'rust';
+    if (firstLines.includes('<!DOCTYPE html>') || firstLines.includes('<html>')) return 'html';
+    if (firstLines.includes('function ') && firstLines.includes('var ') && firstLines.includes('const ')) return 'javascript';
+    if (firstLines.includes('interface ') && firstLines.includes('export class ')) return 'typescript';
+    if (firstLines.includes('@Component') || firstLines.includes('import { Component }')) return 'typescript';


Might want to add a TODO to integrate Highlight.js to do this sort of thing on its own, without having to hard-code it on our side.

eliandoran · 2025-03-26T20:00:15Z

src/services/llm/context/modules/context_formatter.ts

+const CONTEXT_WINDOW = {
+    OPENAI: 16000,
+    ANTHROPIC: 100000,
+    OLLAMA: 8000,
+    DEFAULT: 4000
+};


Again, here it would be nice to define how these values were obtained and how we can maintain them, e.g. if we add a new AI provider.

eliandoran · 2025-03-26T20:14:22Z

src/services/llm/embeddings/chunking/chunking_processor.ts

+import cls from "../../../../services/cls.js";
+import type { NoteEmbeddingContext } from "../types.js";
+// Remove static imports that cause circular dependencies
+// import { storeNoteEmbedding, deleteNoteEmbeddings } from "./storage.js";


Commented-out code must be removed.

eliandoran · 2025-03-26T20:27:33Z

src/services/special_notes.ts

@@ -9,7 +9,7 @@ import searchService from "./search/services/search.js";
 import SearchContext from "./search/search_context.js";
 import hiddenSubtree from "./hidden_subtree.js";
 import { t } from "i18next";
-const { LBTPL_NOTE_LAUNCHER, LBTPL_CUSTOM_WIDGET, LBTPL_SPACER, LBTPL_SCRIPT } = hiddenSubtree;
+const { LBTPL_NOTE, LBTPL_CUSTOM_WIDGET, LBTPL_SPACER, LBTPL_SCRIPT } = hiddenSubtree;


Why this change?

pano9000 · 2025-03-26T20:57:30Z

just a general comment: I saw quite a few uses of "any" as type
→ is there any chance you could try to reduce the number of "any"?

It looks like for some of these that should be rather trivial :-)
e.g. here:

async generateSearchQueries(userQuestion: string, llmService: any): Promise<string[]> {

The idea is to at some point have 0 uses of any :-)

pano9000 · 2025-03-26T21:00:41Z

src/routes/api/ollama.ts

+        const ollamaBaseUrl = baseUrl || await options.getOption('ollamaBaseUrl') || 'http://localhost:11434';
+
+        // Call Ollama API to get models
+        const response = await axios.get(`${ollamaBaseUrl}/api/tags?format=json`, {


I would propose to get rid of axios and just stick to the built-in fetch – from what I saw, we only have that depdency for the backend_script_api, where "axios" is marked as deprecated.

/** * Axios library for HTTP requests. See {@link https://axios-http.com} for documentation * @deprecated use native (browser compatible) fetch() instead */ axios: typeof axios;

pano9000 · 2025-03-26T21:04:48Z

src/routes/routes.ts

@@ -369,6 +374,48 @@ function register(app: express.Application) {
    etapiSpecRoute.register(router);
    etapiBackupRoute.register(router);

+    // Embeddings API endpoints


I have no idea about LLMs and all, but are "embeddings" something LLM specific?
if yes, wouldn't it make sense to have the API path reflect this as well?

e.g. by changing /api/embeddings/ → /api/llm/embeddings/

An embedding is something like a baked croissant. Different bakers can make them with varying quality, but if you give the croissant to your friend at work, they may (or may not) think this croissant is better than one by another bakery.

Long-winded way of saying that even though LLMs generate them, but we use them locally to do cosine similarity computation to see what notes to include at query time and don’t directly provide the embedding to the LLM we’re talking to.

I’m not sure if you’re asking to move the endpoint to /api/<llm>/embeddings or /api/llm/embeddings - but yeah I think the latter makes more sense

pano9000 · 2025-03-26T21:09:06Z

src/routes/routes.ts

+    apiRoute(GET, "/api/embeddings/index-rebuild-status", embeddingsRoute.getIndexRebuildStatus);
+
+    // LLM chat session management endpoints
+    apiRoute(PST, "/api/llm/sessions", llmRoute.createSession);


aren't these missing the CSRF/Auth middleware – or am I missing something

pano9000 · 2025-03-26T21:11:35Z

src/routes/routes.ts

+    apiRoute(PST, "/api/llm/index/notes/:noteId", llmRoute.indexNote);
+
+    // Ollama API endpoints
+    route(PST, "/api/ollama/list-models", [auth.checkApiAuth, csrfMiddleware], ollamaRoute.listModels, apiResultHandler);


same here:
Generally speaking:
I think it would make it cleaner to have these under /api/llm/ if they are LLM specific.

also apart from the above – I wonder here:
wouldn't it make sense to have the route be list-models/${LLM} instead?
at least from a REST point of view, I feel this would "group" these more logically.

pano9000 · 2025-03-26T21:15:07Z

src/services/llm/chat_storage_service.ts

+
+        const { note } = notes.createNewNote({
+            parentNoteId: rootNoteId,
+            title: title || 'New Chat ' + now.toLocaleString(),


does it make sense to have the "default fallback string" translatable?

pano9000 · 2025-03-26T21:15:36Z

src/services/llm/chat_storage_service.ts

+            parentNoteId: 'root',
+            title: 'AI Chats',
+            type: 'text',
+            content: 'This note contains your saved AI chat conversations.'


should these be translatable?

pano9000 · 2025-03-26T21:24:26Z

src/services/llm/context/code_handlers.ts

@@ -0,0 +1,433 @@
+/**
+ * Helper functions for processing code notes, including language detection and structure extraction


just a general question: what is the purpose of this code_handlers exactly – I can see some attempt to extract structure from code notes – aren't all those LLM models "smart enough" to do that themselves?

Again – not a LLM expert, so excuse me if that is a stupid question :-)

Unfortunately not - all that LLMs “see” is text, so we have to build as much of a text representation of the Note (regardless of type) as we can.

Perhaps for the code note type specifically, yes, we can provide the vast majority of the Note content directly to the LLM we’re talking- we’d just have to clean up any tags to minimize its impact on the size of the context

We’re performing those actions on code notes to try to extract what’s useful to minimize impact on the size of the context.

pano9000 · 2025-03-26T21:28:23Z

src/services/llm/context/hierarchy.ts

+
+        // Add note about truncation if needed
+        if (childNotes.length > maxChildren) {
+            context += `... and ${childNotes.length - maxChildren} more child notes not shown\n`;


should be translatable, please :-)

start from scratch again

e09e15a

perfectra1n mentioned this pull request Mar 3, 2025

WIP: LLM Integration #765

Closed

hey look, it doesn't crash again

f2a6f92

eliandoran marked this pull request as draft March 3, 2025 18:41

eliandoran mentioned this pull request Mar 8, 2025

LLM integration #1362

Open

perfectra1n added 25 commits March 8, 2025 20:51

Merge branch 'develop' into ai-llm-integration

9f84a84

create embedding services

b248a7a

set up DB migrations

b97c8dd

set up embedding providers here?

1ff5bc6

add additional AI / LLM options and translations

c442943

set up embedding API endpoints

1361e4d

initialize embeddings if option is enabled

ea6f9c8

add additional options for ollama embeddings

d3013c9

fix the Ollama embedding model setting option breaking

553f7dd

update schema with our new tables

dc439b2

nearly able to process embeddings

6ace4d5

I can create embeddings now?

0daa9e7

Show embedding generation stats to user

0cd1be5

update embedding stats every 5s for user

1ca98e2

show fancier stats

51c83bb

fancier embedding process stats

19bf741

Create better relationships between notes, sanitize ridiculous spacin…

7e232d1

…g to save tokens

update relationship weights

733fdcf

I'm 100% going to have to destroy this commit later

adaac46

try a context approach

cf0e924

it errors, but works

ef6ecdc

actually shows useful responses now

c1585c7

Make the sources section fancier

75e18e4

this is pretty close to opening a new tab?

bd97d97

when a user clicks on a source, don't swap focus

08626c7

perfectra1n added 7 commits March 26, 2025 17:56

move constants to their own files and folder

c49883f

move more prompts to the constants file

a50575c

move more constants from files into centralized location

5869eaf

move providers.ts into providers folder

7138053

fix prompt path import

7c519df

add swaggerUI docstrings for LLM/AI API routes

15630fb

fix updateProvider parameter

baef5f9

eliandoran requested changes Mar 26, 2025

View reviewed changes

Merge branch 'develop' into ai-llm-integration

35fbc73

pano9000 reviewed Mar 26, 2025

View reviewed changes

perfectra1n added 19 commits March 26, 2025 23:05

more heavily weigh notes with title matches when giving context to LLM

a7cafce

set up embedding similarity constants and similarity system

5456ac3

Well the AI chat note type "kinda" works...

aaa3ee2

fix AiChatButton

a05013c

"Chat with Notes" launcher works

921a243

Move additional chat buttons to another spot

d201104

fix requeue errors

44cd2eb

create more interfaces to decrease use of "any"

005ddc4

get rid of timeouts that aren't needed anymore

59e7740

yeet unused app_service.js

0aa2147

fix TPL location in llm_chat_panel.ts

2f573d4

fix ai_settings TPL location

d1cd0a8

Better use of interfaces, reducing useage of "any"

2899707

Do a better job with Ollama context, again

ea4d3ac

do a wayyy better job at building the messages with context

72c380b

centralize prompts

224cb22

centralize LLM constants more

2311c3c

fix linter errors in providers

8497e77

render chat output as markdown, cool

a8fc9e9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] AI/LLM integration #1325

[WIP] AI/LLM integration #1325

perfectra1n commented Mar 3, 2025 •

edited

Loading

eliandoran Mar 26, 2025

eliandoran Mar 26, 2025

eliandoran Mar 26, 2025

eliandoran Mar 26, 2025

eliandoran Mar 26, 2025

perfectra1n Mar 26, 2025

eliandoran Mar 26, 2025

eliandoran Mar 26, 2025

eliandoran Mar 26, 2025

eliandoran Mar 26, 2025

eliandoran Mar 26, 2025

pano9000 commented Mar 26, 2025

pano9000 Mar 26, 2025 •

edited

Loading

pano9000 Mar 26, 2025

perfectra1n Mar 26, 2025

pano9000 Mar 26, 2025

pano9000 Mar 26, 2025

pano9000 Mar 26, 2025

pano9000 Mar 26, 2025

pano9000 Mar 26, 2025

perfectra1n Mar 27, 2025

perfectra1n Mar 27, 2025

perfectra1n Mar 27, 2025

pano9000 Mar 26, 2025

		@@ -0,0 +1,13 @@
		function initComponents() {
		// ... existing code ...

		if (query.toLowerCase().includes("provide details about") \|\|
		query.toLowerCase().includes("information related to")) {

		@@ -0,0 +1,433 @@
		/**
		* Helper functions for processing code notes, including language detection and structure extraction

[WIP] AI/LLM integration #1325

Are you sure you want to change the base?

[WIP] AI/LLM integration #1325

Conversation

perfectra1n commented Mar 3, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pano9000 commented Mar 26, 2025

pano9000 Mar 26, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

perfectra1n commented Mar 3, 2025 •

edited

Loading

pano9000 Mar 26, 2025 •

edited

Loading